Corpus: hun-ro_web_2015_30K

Other corpora

4.4.1.5 Number of Word-N-grams at Sentence Endings

Number of word-N-grams for N=1...5 for the first K sentences

K # of words # of bigrams # of trigrams # of 4-grams # of 5-grams
100 94 99 99 99 99
1000 846 992 997 997 997
10000 7037 9688 9968 9982 9987
100000 18522 28447 29774 29898 29925
1000000 18522 28447 29774 29898 29925


Zipf's diagram for sentence endings


Gnuplot diagram

2387 msec needed at 2018-04-27 17:12